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ABSTRACT 

We use the mock catalog of galaxies, constructed based on the COSMOS galaxy catalog including 
information on photometric redshifts (photo-z) and SED types of galaxies, in order to study how 
to define a galaxy subsample suitable for weak lensing tomography feasible with optical (and NIR) 
multi-band data. Since most of useful cosmological information arises from the sample variance 
limited regime for upcoming lensing surveys, a suitable subsample can be obtained by discarding 
a large fraction of galaxies that have less reliable photo-z estimations. We develop a method to 
efficiently identify photo-z outliers by monitoring the width of posterior likelihood function of redshift 
estimation for each galaxies. This clipping method may allow to obtain clean tomographic redshift 
bins (here three bins considered) that have almost no overlaps between different bins, by discarding 
more than ~ 70% galaxies of ill-defined photo-z's corresponding to the number densities of remaining 
galaxies less than ~ 20 per square arcminutes for a Subaru-type deep survey. Restricting the ranges 
of magnitudes and redshifts and/or adding near infrared data help obtain a cleaner redshift binning. 
By using the Fisher information matrix formalism, we propagate photo-z errors into biases in the 
dark energy equation of state parameter w. We found that, by discarding most of ill-defined photo-z 
galaxies, the bias in w can be reduced to the level comparable to the marginalized statistical error, 
however, the residual, small systematic bias remains due to asymmetric scatters around the relation 
between photometric and true redshifts. We also use the mock catalog to estimate the cumulative 
signal-to-noise (S/N) ratios for measuring the angular cross-correlations of galaxies between firmer 
photo-z bins, finding the higher S/N values for photo-z bins including photo-z outliers. 
Subject headings: cosmology: theory - gravitational lensing - photometric redshift 



1. INTRODUCTION 

The bending of light by mass, gravitational lensing, 
causes images of distant galax ies to be distorted (e.g. 
iBartelmann fe Schneider! [20011 for a thorough review). 
These sheared source galaxies are mostly too weakly dis- 
torted to measure the effect in individual galaxies, but 
requires surveys containing at least millions o f galaxies 
to de tect the signal in a statistical way (e.g. see lFu et al.l 
120081 for the latest measurement result). This cosmic 
shear is now recognized as one of the most promising 
probe that allows a direct reconstruction of the dark mat- 
ter distribution as well as to constrain the properties of 
dark energy or to test the theory of gravity on cosmo- 
logical scales (e.g. jHoekstra fc Jainl 120081 : iMassev et al.l 
l201CtlH"utererll2010l for recent reviews). In particular, by 
adding redshift information of source galaxies the lensing 
geometrical information as well as the redshift evolution 
of dark matter clustering can be inferred, thereby al- 
lowing to signifi cantly improve its ability of constraining 
cosm ology (e.g. |Hu| 119991 : iHutererl l2002h iTakada fc Jainl 
l200l . 

To address questions about the nature of dark energy 
and/or the properties of gravity on cosmological scales, a 
number of ambitious wide-field optical and infrared imag- 
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ing surveys have been proposed: the Panoramic Survey 
Telescope & Rapid Response System (Pan-STARRS 4 ), 
the Dark Energy Survey (DES 5 ), the Large Synoptic 
Sky Survey (LSST 6 ), the space-based Joint Dark Energy 
Mission (JDEM 7 ), and the EUCLID. However, there 
are several sources of systematic errors inherent in weak 
lensing measurements, and understanding the systematic 
errors is currently the most important issue for a chieving 
the fu ll potential of planned lensing surveys (e.g.. IHutererl 

One of the most dangerous systematic errors is the un- 
certainty in estimating redshifts of source galaxies. Since 
it is practically infeasible to obtain spectroscopic red- 
shifts for the huge number of imaging galaxies (10 8 -10 9 
galaxies for future surveys), redshifts of galaxies need to 
be estimated from multi-band photometry - the so-called 
photometric redshifts (hereafter photo-z). Both statisti- 
cal errors and systematic biases in the relation between 
photometric and spectroscopic redshifts need to be well 
controlled (e.g. a sub-percent level for the bias for future 
surveys) in order not to have any serious biases in cosmo- 
logical para meters comparable with the appare nt statis- 
tical errors (|Huterer et alj|2006t iMa et alj|2006| ). Under- 
standing the properties of photo-z errors is also impor- 
tant in exploring an optimal survey design given the goal 
of achieving the desired level cosmological constraints; 
depth vs number of filters vs area surveyed. Given these 

4 http:/ /pan-starrs. ifa.hawaii.edu 

5 http:/ /www. darkcncrgysurvey.org 

6 http://www.lsst.org 

7 http://jdem.gsfc.nasa.gov 
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research backgrounds there are recent stu dies on photo- 
z requirement studies based on real data (lAbdalla et al.l 
[20081 iLima et~aT1[2008t iCunha et al.l[2009h . 

In this paper we would like to focus on an issue of 
how to identify and remove catastrophic redshift errors 
- the case that photometric redshift is gr ossly misesti- 
mated (also see iBernstein fc Hutererl [20091 for the simi- 
lar study). This can be done by monitoring the posterior 
likelihood function of photo-z estimation for each galax- 
ies. The important fact is that future surveys arc planned 
to use the sample variance limited regime in cosmic shear 
information rather than the shot noise regime in order to 
constrain cosmology. Therefore one can discard a large 
fraction of galaxies whos e photo- z estimations are less 
reliable ([Jain et al.ll2007t) . Thus it would be worth ad- 
dressing how to construct a galaxy subsample suitable 
for lensing experiments. Having such a subsample of 
galaxies with reliable photo- z estimates may also relax 
requirements on a spectroscopic training set to calibrate 
the residual photo-z errors. In this paper we will address 
these issues by using the mock catalog of photometric 
galax ies constructed ba sed on the COSMOS photo-z cat- 
alog (jllbert et al.ll2009| ) that provides the currently most 
reliable photo-z catalog calibrated with 30 bands data 
and spectroscopic subsample. 

This paper is organized as follow. In Section [2j we 
briefly overview the theory of weak lensing. In Section [3] 
we describe the details on how to make our simulated 
mock catalog of photometric galaxies based on the COS- 
MOS data. In Section [4] we use the simulated catalog 
to assess the performance of photo-z estimation assum- 
ing survey parameters on depth and filter set, which are 
closely chosen to resemble the Subaru Hyper Suprime- 
Cam(HSC) Survey. In Section [5] we show the main re- 
sults: we use the simulated photo-z catalog to implement 
hypothetical weak lensing experiment, paying particular 
attention to how to construct a galaxy subsample which 
is defined such that it has minimal impact of the photo-z 
errors on cosmological parameters. Section [6] is devoted 
to summary and discussion. Unless explicitly stated we 
assume the concordance ACDM model consiste nt with 
the WMAP 5-year results (jKomatsu et al.ll2009t) . 

2. PRELIMINARIES 

In this section we briefly review the basics of cosmic 
shear tomography. Throughout this paper we work in 
the context of a spatially flat cold dark matter model for 
structure formation. 

2.1. Convergence Power Spectrum 

Gravitational shear can be simply related to the lensing 
convergence: the weighted ma ss distribution integrated 
along the line of sight (e.g. see iBartelmann fc Schneider! 
120011 for a thorough review and references therein). Pho- 
tometric redshift information on source galaxies allows us 
to subdivide galaxies into redshift bins, enabling more 
cosmological information to be extra cted, which is re- 
ferred to as lensing tomogra phy (e.g. iHul 119991 : iHutererl 
I2002t iTakada fc Bridldl2007l ). In the context of cosmo- 
logical gravitational lensing, assuming the flat-sky ap- 
proximation and the Limber's approximation, the lens- 
ing power spectrum of the i,j -th tomographic bins can 



be expressed as 

3 Jo X 2 {z)H{z) V x J 

where H(z) is the Hubble expansion rate, \ IS the co- 
moving angular diameter distance up to redshift z, and 
P&(k, z) is the three-dimensional matter power spectrum 
at scale k and at redshift z. The lensing weight function 
Wu\ (x) in the i-th tomographic redshift bin, defined to 
lie between the redshifts Zi and Zj+i, is given by 

Wi(z) = ^n m0 H^ gi (z)(l + z), (2) 

and 

n ( v \-)X(z)l dz^— 1 — ,z<z l+ i 

\ 0, z > z.j + i 

where n(z) is the redshift distribution of galaxies, and 
is the average number density of galaxies residing in the 
i-th tomographic bin (or the redshift range z = [z,, Zj+i]): 
fn = dz'n(z'). 

In practice the power spectrum measured from a 
galaxy survey has shot noise contamination arising from 
the finite sampling of galaxy images. Hence the measured 
power spectrum becomes 

C? j (!) = P$(l) + £s$, (3) 

m 

where o~ e is the rms intrinsic ellipticities per component 
and 6fj is the Kronecker delta symbols; 8^ = 1 when 

i = j, otherwise 5 ft- = 0. 

Note that the distribution n(z) appearing in Eq. © de- 
notes the underlying true redshift distribution of galax- 
ies used in lensing analysis. However, the distribution 
needs to be inferred from photo-z information of individ- 
ual galaxies available from multi-color imaging data sets. 
This generally causes biases in the lensing power spec- 
trum in the presence of photo-z errors. As long as tomo- 
graphic redshift bins are broad enough, O(10 7 ) galaxies 
are available in each bin for a Subaru-type survey with 
^1000 degree 2 sky coverage. Therefore the statistical er- 
rors of photo-z are not problematic: the lensing power 
spectrum is primarily sensitive to the mean redshift of 
source galaxies. Instead, a precise knowledge of the mean 
redshift in each tomographic bin is required not to have 
any significant biases in best-fit pa rameters compared t o 
the statistical errors, as studied in lHuterer et al.l (|2006l) . 

To assess the required photo-z accuracies for lensing 
tomography, an important fact we should keep in mind 
is the lensing measurement for planned wide-field surveys 
is not shot noise limited. Hence a significant fraction of 
galaxies with ill-defined photo-z's can be disca rded, with- 
out s everely degrading parameter accuracies ([Jain et al.l 
| 2007f ). With these considerations in mind we will in the 
following address how to define an adequate subsample of 
galaxies for a given multi-color data set. Figure Q] gives 
a quick summary of the impact of redshift uncertainty 
on the lensing power spectrum for no tomography case, 
i.e. a single redshift bin. Here for simplicity we assumed 
the redshift distribution given by the analytical form, 
n(z) oc z 2 cxp[— (z/zq) 2 ] with zq = 1, corresponding to 
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Fig. 1. — The solid curve shows the lensing shear power spec- 
trum assuming the concordance ACDM model and the galaxy red- 
shift distribution with mean redshift (z) = 1.13 (see text for the 
details). The dashed curve shows the resulting spectrum when 
the mean redshift is shifted by 5%, while the dotted curve shows 
the spectrum when the dark energy equation of state parameter is 
changed to ui = — 1.2. The shaded boxes around the fiducial power 
spectrum show the expected la error at each multipolc bins assum- 
ing fig = 2000dcg 2 , fig = 30 arcmin - 2 , and cr e = 0.22 for survey 
area, the average number density of galaxies and the rms intrin- 
sic cllipticitics, respectively. The bottom panel shows the relative 
differences of power spectra with respect to the fiducial spectrum. 

the mean redshift (z) = 2zo/ 's/tt ~ 1.12. (Note that the 
following results are all computed using simulated galaxy 
catalogs that have different redshift distributions). The 
plot shows that a 5% change in the mean redshift causes 
a 10%-level change in the power spectrum amplitude, 
and the amount of the change varies with multipoles due 
to the projection of the nonlinear matter power spec- 
trum. This change can be compared with the effect of 
dark energy equation of state and the statistical errors 
at each multipole bin expected for the power spectrum 
measurement. Clearly such a bias in the mean redshift 
is problematic for planned surveys. 

3. A SIMULATION OF PHOTOMETRIC GALAXY 
CATALOG 

To assess the impact of photo- z errors on cosmic shear 
tomography, we take the following procedure. First we 
simulate a mock catalog of galaxies that contain infor- 
mation on true redshifts, magnitudes in each filters and 
spectral energy distribution (SED) for survey parame- 
ters we consider. Then we estimate photometric redshifts 
(hereafter often photo-z) for each simulated galaxies from 
its colors, yielding the photo- z catalogs. 

A quick summary of the procedures used in making the 
mock photometric catalog is as follows: 

1. Based on the resu lts of COSMOS photo-z catalog 
(jllbert et al.ll2009f >. we first model the redshift dis- 
tribution of galaxies as a function of the i-band 
magnitudes down to i = 25.8 (see § 13.11) . 

2. Use the synthetic galaxy spectral model, GIS- 
SEL98, to generate a set of SED templates for each 
type of galaxy, where the age and star formation 
history are randomly varied (see § 13. 2|) . 
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Fig. 2. — The redshift distribution of our simulated galaxies 
(containing about 10 5 galaxies) as a function of the i-band magni- 
tude ranges as indicated by the labels. Note that the mock catalog 
is constructed so as to reproduce the redshift distribution found in 
the COSMOS galaxies with i < 25. The galaxies with 25 < i < 25.8 
are simulated by extrapolating the COSMOS results down to the 
fainter magnitudes (see text for the details). 



Use the HyperZ code to generate a mock photo- 
metric catalog of galaxies in which the spectral en- 
ergy distribution and redshift are assigned to each 
galaxy. In doing this the catalog is made by im- 
posing the conditions that the catalog satisfy the 
redshift-magnitude relation as well as reproduce an 
appropriate mixture of galaxy SED types which is 
consistent with the COSMOS galaxy population 
(see 8153)1. 



For a given set of filters, compute apparent mag- 
nitudes in each filter for each simulated galaxy by 
taking into account the filter transmission curve 
and the redshiftcd spectrum (see § I3.4|> . The sta- 
tistical magnitude errors are also added to the mag- 
nitude of each filter (see 



To make a realistic simulated catalog, we assume survey 
parameters (depth, filter transmission curves, and so on) 
that are chosen to well resemble the planned HSC survey. 
We also consider the external imaging data sets of it-band 
and/or NIR to study how combining the different colors 
improves photo-z accuracies. 

In this paper we use the mock galaxy catalog contain- 
ing about 10 5 galaxies in the range 20 < i < 25.8. 

In the following subsections we will describe the de- 
tails of each procedure above, and a reader who is more 
interested in the results can skip these subsections and 

go to §m 

3.1. Magnitude- Redshift Relation 

To make a mock catalog we need to properly take into 
account the redshift distribution of galaxies, which varies 
with the range of magnitudes considered. For exam- 
ple, fainter galaxies are preferentially at higher redshifts. 
This is the so-called magnitude-redshift relation. We use 
the magnitude-redshift relation estimated from the COS- 
MOS catalo g of galaxies select ed by the Subaru i-band 
magnitudes (jllbert et aLll2009D . The COSMOS catalog 
provides currently the most accurate photometric red- 
shifts because the photo-z arc estimated from 30 broad, 
intermediate, and narrow bands covering from UV, op- 
tical to mid infrared. Also the photo-z e stimates are 
calibr ated by the spectroscopic sub-sample. Illbert et al.1 
(|2009[ ) studied subsamples of photo-z galaxies for differ- 
ent limiting magnitudes and showed that the resulting 
redshift distributions are well fitted by the polynomial 
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Fig. 3. — Plotted are synthetic galaxy spectra, f v = \ 2 f\ 
normalized at A = 4000A. For each panel, from top to bottom 
at 10000A, galaxy ages are lOGyr, 5Gyr, lGyr, lOOMyr, lOMyr 
and lMyr, respectively. From the top-left to bottom-right panels, 
the galaxy SED types are star-burst (SB), elliptical (Ell), irregular 
(Im), and spiral (Sc), respectively. 

form: 

n{z)=A'^—, (4) 
z° + c 

where A, a, 6 and c are the fitting parameters. The 
best-fit parameters for different mag nitudes in the rang e 
i = [22,25] arc given in Table 2 in lllbert et "all (|2009l ). 
The COSMOS data is deep enough in the i-band, and 
safely considered as a magnitude limit sample for i < 25. 
The COSMOS catalog also includes information on the 
angular number counts of galaxies for a given magnitude 
range as well as on the estimated galaxy SED type for 
each galaxy. To model a hypothctically deeper survey we 
are interested in, we extrapolate the fitting parameters to 
obtain the redshift distribution for fainter galaxies down 
to i = 25.8. 

We thus generate a mock i-band photometric catalog 
of galaxies such that the resulting catalog satisfies the 
magnitude and redshift relation for different ranges of i- 
band magnitudes. Figure [5] shows the magnitude-rcdshift 
distributions for the simulated catalog containing about 
10 5 galaxies. 

3.2. Synthetic Spectral Models 

For a given simulated galaxy labeled with some i-band 
magnitude and redshift z, we need to model the spectral 
energy distribution from which the apparent magnitudes 
can be computed for a given set of filters. We use the 
publicly available library GISSEL 98 (jBruzual fc Chariot! 
ll993tl2003l;lBolzonelia et al.ll2000D to model the synthetic 
galaxy spectrum. The galaxy SEDs are generated to rep- 
resent from early- to late-type SEDs (elliptical, SO, Sa, 
Sb, Sc, Sd, Im and starburst). To model these popula- 
tions - composite stellar populations (CSPs) - the single 
stellar population (SSP) is convolved with a model star 
formation history: 

/csp(*)= / Ht-r)f SSP (r)dr. (5) 
Jo 



Note that the SSP is modeled in iBruzual fc Chariot] 
(119931) . with t h e init ial stellar mass function given in 
iMiller fc Scalol (|1979t) . The function ip(t) is the star 
formation rate at galaxy age t. We assumed %j){t) cx 
cxp(-t/r) with t = 1,2, 3, 5, 15, 30 Gyr for elliptical, SO, 
Sa, Sb, Sc, and Sd galaxies, respectively. For a starburst 
galaxy, the star formation is instantaneously occurred, 
while ip = constant for an irregular (Im) galaxy. The 
metalicity is self-consistcntly evolved with galaxy age, 
and we checked that different models o f metalicity little 
chang e the photo- z estimates (also see iBolzonella et al.l 
l2000f l. We randomly chosen the age of each simu- 
lated galaxy from 221 different ages in the range of 
t = [0, 20] Gyr, where the age interval is done accord- 
ing to GISSEL98. 

Figure [3] demonstrates simulated SEDs for starburst, 
elliptical, irregular, and spiral galaxies for 6 differ- 
ent ages. The du st extinction is modeled following 
ICalzetti et all (|2000f ) with A v in the range [0,2.0]. 

3.3. Mock Galaxy Catalog 

To make a mock galaxy catalog containing various 
galaxy populations, we used the publicly available code 
Hyper^ . In doing this we need to account for an ap- 
propriate mixture of different galaxy SED types. We 
employed the composition (SB, E, S, Im)=(0.52, 0.035, 
0.40, 0.045) over all the redshift range, which is chosen 
so as to match the composition of best-fit galaxy SED 
types found from the COSMOS catalog with i < 25 9 . 
For this purpose the command make-catalog in HyperZ 
was slightly modified in such a way that the resulting 
catalog satisfies the magnitude-redshift relations for each 
magnitude range in § 13.11 and the assumed composition 
of galaxy SED types, because the original make-catalog 
generates a catalog that redshift, reference magnitude, 
age, galaxy SED type and the amount of dust extinction 
are randomly assigned to each galaxy. 

3.4. Photometric Magnitudes 

Once the spectral energy distribution is specified for 
each simulated galaxy at redshift z, it is straightforward 
to compute the apparent magnitudes for a given set of 
filters taking into account the redshifted spectrum at ob- 
served wavelengths. The photo- z estimate is sensitive to 
the details of observational parameters: the wavelength 
coverage, the transmission curve of a given filter, the 
exposure time, the limiting magnitude, and so on. We 
consider the parameters that match those of the planned 
Subaru HSC survey: our default filter set is g'r'i'z'y' 
(hereafter the prime superscripts are sometimes omit- 
ted), and the 5er limiting magnitudes (2" aperture) are 
set to g = 26.5, r = 26.4, i = 25.8, z = 24.9, and 
y = 23.7, respectively, assuming the exposure time of 
15 minutes for each pass-band, 3 days from new moon, 
and 1.2 airmass at the Subaru Telescope site 10 . 

We also study how adding other bands, it-band data 
and NIR data, into the optical data above can improve 
photo- z accuracies. Having a wider wavelength cover- 
age helps break degeneracies in photo- z estimates, more 

8 http://webast.ast.obs-mip.fr/hyperz/ 

9 http://cosmos.astro.caltech.edu/data/index.html 

10 The details can be found from 
http: / /www. nao j.org/Observing/lnstruments/SCam/index. html 

and l lMivazaki et al.H2UUli ) 
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TABLE 1 

Filters and limiting magnitudes (5<t) 



Filter 


v\rr , \r 




J_ V V J- i-lvl 1 A J 


A R TY1 H CT 


± CXp ^CL J 




CFHTLS 


3752 


740 


25.0 


900 


<?' 


HSC 


4814 


1120 


26.5 


900 


r' 


HSC 


6279 


1370 


26.4 


900 


i' 


HSC 


7687 


1500 


25.8 


900 


z> 


HSC 


9143 


1330 


24.9 


900 


y' 


HSC 


9923 


490 


23.7 


900 


J 


VIKING 


12578 


1713 


22.1 


420 


H 


VIKING 


16581 


2828 


21.5 


420 


K 


VIKING 


21790 


2828 


21.2 


420 



exactly helps discriminate the Lyman break and 4000 
angstrom break, from the multi-color data. We here 
consider the u-band data that can be delivered from 
CFHT, and also the J, H, K s (hereafter K) bands of 
planed VIKING (VISTA Kilo-Degree Infrared Galaxy) 
survey. The 5rx limiting magnitudes are u = 25.0, 
J = 22.1, H = 21.5 and K = 21.2, respectively. The 
set of filters and the depths we consider in this paper are 
summarized in Table [T] 

Detailed study for the optimal filter parameters, for 
example, the number of filters, filter resolution, in terms 
of mini mizing the out l ier fra ction or photo-z scatters are 
found in lJouvel et"aH (|2010f ). 

3.5. Magnitude Errors 

Finally we include statistical errors in the apparent 
magnitudes. Assuming the sky noise limit, we simply 
model this magnitude error as Gaussian fluctuations with 
width given by 

Am -2.5 log + , (6) 

where SN is the signal-to-noise ratio for a given galaxy; 
SN is computed once its apparent magnitude and the 
depth in the filter are given. The magnitude error is 
computed as follows. First, the sky noise is added to 
the observed flux of a galaxy, causing a deviation from 
the true flux as / obs = f + Af = / (1 + l/SN). 
Then the magnitude error above is computed as Am = 
-2.51og(l + l/SW) because m + Am = -2.51og/ (l + 
l/5iV)+constant. Exactly speaking, even for a Gaussian 
sky noise, the magnitude error does not obey a Gaus- 
sian distribution due to the log-mapping. However, the 
Gaussian approximation on Am holds for galaxies with 
sufficiently high SN values, which we will assume for the 
following results. 

Note that a galaxy, which has its apparent magnitude 
near the limiting magnitude, may be excluded from or 
included in the sample in the presence of the magnitude 
error. While our simulated galaxies are all z-band se- 
lected, some galaxies may have apparent magnitudes be- 
low the detection limit in other pass-bands. We use such 
an upper limit on the flux in the photo-z estimate, which 
improves the photo-z's to some extent. In doing this the 
flux for such an undetected galaxy in a given filter is 
set to the magnitude corresponding to the halved limit- 
ing flux /ii m /2. Also we note that the systematic offset 
of photometry may cause an additional uncertainty on 
magnitude measurement, which in this paper is ignored. 
Such a zero-point magnitude offset can be, for example, 



calibrated by usin g a spectroscopic redshift subsamplc 
(|Ilbert et al.l[200l) . 

According to the procedures described above we made 
a mock photometric catalog containing about 10 5 galax- 
ies down to the magnitude i = 25.8. Although we tried 
to make a realistic mock catalog based on the COSMOS 
catalog, some of our treatments may be still oversimpli- 
fied: for example, we assumed a single stellar population 
for galaxy SEDs. The simplified assumptions may make 
our results somewhat optimistic. A more accurate way to 
overcome these obstacles is using the real data including 
spectra for a representative subsamplc of imaging galax- 
ies. However, such a spectroscopic data especially for 
faint galaxies of interest is still limited, and this is our 
future work. 

4. METHOD: PHOTOMETRIC REDSHIFT AND 
PARAMETER BIAS 

Now we use the mock photometric catalog of i-band 
selected galaxies to assess the performance of photo-z 
estimates in the context of weak lensing tomography ex- 
periment. 

4.1. Photometric Redshift Estimation 

By combining multi-passband magnitudes of a given 
imaging galaxy, its redshift can be estimated with- 
out spectroscopic observation - the so-called photo- 
z. There are various techniques th at have been devel- 
oped: the template fit ting method (|Sawicki et al.|[l997t 
iBolzonella et al.l 120001) . the template method combined 
with prior information (magnitud e prior and so on) 
(|Bemtezl2000HMobasher et al.ll2004l) . the method includ- 
ing a self-calibration base d on a training spectroscopic set 
(|Collister &; Lahavll2004 and see references therein). 

In this paper we use the publicly available code, Le 
Phare 11 , which is a template fitting method. The photo- 
z for each galaxy is estimated based on the \ 2 fitting: 

a ^ [f° hs -amz,E)] 2 

i 1 

where /° bs is the observed flux in the z-th filter, f(T, z, E) 
is the model flux which is given as a function of galaxy 
SED type (T), redshift (z) and the amount of dust ex- 
tinction (E), and rjj is the magnitude error. Note that 
galaxy SED type is modeled according to the method de- 
scribed in § 13.21 The summation runs over the number 
of filters considered (Nf). The extra factor parameter 
a, which is the same in all the filters, is introduced in 
Eq. ([7]) because the photo-z is estimated only from col- 
ors, the relative amplitudes of fluxes in different filters, 
not from the absolute fluxes. Therefore there are (Nf- 
1) constraints given the data of Nf filters. The best-fit 
redshift parameter, i.e. the best-fit photo-z, is obtained 
by minimizing the \ 2 value with varying other model 
parameters. 

If the location of spectral features such as Lyman break 
and 4000A break is captured given the wavelength cov- 
erage of observed filters and the magnitude depths taken, 
the redshift is robustly estimated. On the other hand, 
a misidentification of the spectral features causes a de- 
generacy in redshift estimation, often yielding multiple 

11 http: / /www.cfht. hawaii.edu/~arnouts/lephare. html 
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solutions at low and high rcdshifts. Hence the photo-z 
method based on broad band photometry generally yields 
a large fraction of outliers, where the best-fit photo-z can 
be far from the true redshift. To quantify the photo-z 
accuracy for each galaxy we will use the following two 
quantities: (1) the goodncss-of-fit parameter for the tem- 
plate fitting, and (2) the width of likelihood function of 
redshift estimation. 

The goodness-of-fit for the template fitting of a given 
galaxy may be defined as 



2 _ Xmin 

is the minimum 



(zbf) 



(8) 



where Xmin * s the minimum \ z value for the best-fit 
model and redshift, Zbf is the best- fit redshift and Nf — 1 
is the number of colors available. Note that the quantity 
above, x^j is n °t the same as the reduced x 1 , which is 
defined as the number of constraints minus the number 
of model parameters. The number of model parameters 
are equal for all the galaxies, so \1 gives a measure of 
the goodness-of-fit. 



0.8 
0.6 



0.8 



Var(z)=1.1921 



Var(z)=0.54BB7E-01I 



redshift 

Fig. 4. — Examples of the photo-z fitting for two simulated 
galaxies. The upper panel shows an example of the photo-z outlier, 
i.e. ill-defined photo-z estimate, while the lower panel shows an 
example of the reliable photo-z estimate. In each panel, the solid 
curve shows the likelihood function of redshift parameter, p(z) oc 
exp[— x 2 /2], and the solid (red) and dashed (blue) arrows denote 
the best-fit redshift and the true redshift, respectively. The width 
of the likelihood is quantified by Var(z), defined by Eq. l[9}, and 
the value for each simulated galaxy is denoted in the upper-right 
corner of each panel. 

We also use the width of likelihood function of redshift 
estimation for each galaxy defined as 



Var(z) 



dz (z - z hi fp{z) 



1 



(1 + z u ) ' 



(9) 



where p(z) is the likelihood function given as p(z) cx 
exp[— x 2 (z)/2], which is normalized so as to satisfy 
Jdz p(z) = 1. We compute the likelihood function p(z) 
by fixing other model parameters to their best-fit val- 
ues. The normalization factor (1 + Zbf) is introduced 
based on the fact that the photo-z accuracy scales with 
(1 + z). Compared to c(z), the local standard deviation 



of photo-z estimation, which is obtained from A% 2 < 1, 
the quantity Var(z) is sensitive to the outlier probability 
with | z — Zbf | 3> 1 due to the weight (z — Zbf) 2 . Hence for 
galaxies whose likelihood function has multiple peaks, i.e. 
multiple redshift solutions, the quantity Var(z) tends to 
be larger. The similar quantities to Var(z) are also use d 
in the previous works (|Mobasher et al.ll200l IWoll2009l) , 
where the primary purpose is to improve the photo-z per- 
formance for a majority of galaxies. In this paper we use 
the figurc-of-mcrit quantity Var(z) mainly for identifying 
photo-z outliers. 

Note that we will in the following results use Zbf 
to construct the redshift distribution of galaxies. An 
alternative method, which may be less sensitive to 
photo-z outliers, is summing the photo-z posterior like- 
lihood function p{z) over all the sam pled galaxies to 
obtain the overall redshift distribution ([Wittmanl 120091 : 
ICunha et~a l. 2009). Calibrating the mean redshift distri- 
bution is another impo r tant issue to be caref ully studied 
(|Ma k Bernstein! 120081 : iBordoloi et "aDl2009t l. but is be- 
yond the scope of this paper. 

Figure |4] demonstrates examples of the photo-z fitting 
for two simulated galaxies. It can be found that a galaxy 
which has the ill-defined photo-z estimate, i.e. the wider 
likelihood function of redshift estimation, tends to have 
a larger value of Var(z). In particular, even if the like- 
lihood function locally has a narrow peak around the 
best-fit redshift, therefore even if the photo-z error looks 
apparently small, the value Var(z) becomes larger if the 
likelihood has multiple peaks (i.e. the case of multiple 
redshift solutions), as demonstrated in the upper panel. 
On the other hand a galaxy with reliable photo-z esti- 
mate has a small value of Var(z). 

While the quantity Var(z) is rather empirically defined 
as an indicator of photo-z outliers, Figure Ogives a quan- 
titative study of how Var(z) can characterize the photo-z 
likelihood function. The figure shows the distributions of 
simulated galaxies in the Var(z)-Az plane, where Az is 
the difference between true and photometric redshifts de- 
fined as Az = (zbf — z truc )/(l + zt rU e) for each galaxy. Ac- 
cording to the properties of their photo-z likelihood func- 
tions, we divide galaxies into two subsamples: one (black- 
thick line) is defined from galaxies (about 25% fraction 
of all the galaxies) that have a single peak therefore a re- 
liable photo-z estimate, while the other (gray-thin line) 
is from galaxies of multiple peaks, respectively. Note 
that the second and higher-order peaks are defined from 
local maxima of the likelihood that have heights higher 
than 10% of the first peak height. One can find from the 
figure that the quantity Var(z) nicely separate galaxies 
that tend to have have greater photo-z biases and mul- 
tiple solutions of photo-z's, i.e. degenerate photo-z esti- 
mate; most of galaxies having Var(z) 0.1 have multiple 
peaks. 

4.2. Fisher matrix formalism 

We will use the Fisher matrix formalism to estimate ac- 
curacies of estimating parameters given the lensing power 
spectrum. The Fisher matrix is given by 



F 



a0 



E 



( i ,j,m,n 



dp 



dpp 
(10) 
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iJoachimi fc Sc hneider 2009, for the detailed derivation): 



dem 



Var(z) 



Fig. 5. — The probability distributions of simulated galaxies in 
parameter space of Var(z) and Az, where Var(z) is defined in terms 
of the photo-z likelihood by Eq. Q and Az denote the bias between 
photometric and true redshifts, Az = (zj,f — ztruo)/(l + ztruc). In 
each panel, black-thick dots or lines show galaxies whose photo-z 
likelihood function has a single peak, while gray-thin dots or lines 
show galaxies that multiple peaks in their likelihood. It is clear 
that galaxies, which have greater photo-z biases and have multiple 
photo-z solutions in their likelihood, tend to have greater values of 
Var(z). 



where p a denotes a set of cosmological parameters, the 
matrix C denotes the covariance matrix, and C -1 de- 
notes the inverse matrix. In this paper we simply use 
th e covariance ma t rix gi ven by the first term in Eq. (9) 
m ITakada fc Jam] (|2009t ) assuming the Gaussian errors 
on power spectrum measurements. The Gaussian er- 
ror assumption is adequate for our purpose because the 
impact of no n- Gaussian errors on parameter estimation 
is not significant as long as a multi-parameter fitting 
is considered as shown in |Takada fc Jain! (|2009l ) . The 
marginalized la error on the a-th parameter p a is given 
by a 2 (p a ) = (F _1 ) QQ ,, where F _1 is the inverse of the 
Fisher matrix. Throughout this paper we set l m [ n = 5 
and l max = 3000 as for the minimum and maximum mul- 
tipoles in the summation above. Note that all the pa- 
rameter forecasts shown below are for the lensing tomog- 
raphy combined with the expected Planck information, 
which is obtained simply by adding the two Fisher ma- 
trices of lensing and CMB: F wl +cmb = ^wl + -Fcmb- 
As explained around Eq. ([T]), the lensing power spec- 
trum is sensitive to the underlying true redshift distri- 
bution of galaxies, n(z). For a multi-color imaging sur- 
vey, however, the distribution n(z) needs to be estimated 
from the available photo-z information. In this proce- 
dure the photo-z errors affect weak lensing experiments. 
Most dangerous effect is a systematic bias in parameter 
estimations: if the inferred redshift distribution has a 
bias in the mean redshift compared to the true one, the 
redshift bias may cause significant biases in cosmologi- 
cal parameters. In order to quantify the biases in cos- 
mological parameters caused by photo-z errors, we use 
the fol lowing method based on t he Fisher matrix formal- 
ism in lHuterer fc Takadal (|2005l ) (also see Appendix B of 



k. photo— z / gl 



(*)] , 
(11) 



where F," 



is the inverse of the Fisher matrix, 



WL+CMB 

and Sp a denotes a bias in the a-th parameter, the dif- 
ference between the best-fit and true values. In the 
equation above, the spectrum C!^ n (l) is the underlying 
true power spectrum, while C^ hoto ~ z (l) is the spec- 
trum estimated from the redshift distribution inferred 
based on the photo-z information. In the presence of 
the photo-z errors, generally Cg ^ C^ photo ~ z , thereby 
causing a bias in parameter estimation. We can com- 
pute both spectra, C K and C K, ' phot °~ z from a simulated 
galaxy catalog for a hypothetical lensing survey. Note 
that in Eq. (fTTj). for simplicity, we have not considered 
any other nuisance parameters that model other system- 
atic effects such as the shape measurement errors (e.g. 
iHuterer et al.l 120061 ) and the inability to make precise 
model predictions arising from nonlinear clustering and 
baryonic physics (iHuterer fc Takadal l2005t iRudd et"aTI 
l2008HZentner et alll2008f ). 

To compute the parameter forecasts we need to spec- 
ify a fiducial cosmological model and survey parameters. 
Our fiducial c osmological model is b ased on the WMAP 
5-year results (|Komatsu et al.ll2~009[) : the density param- 
eters for dark energy, CDM and baryon are nd e (= 0.74), 
n cdm h 2 (= 0.1078), and fL h h 2 (= 0.0196) (note that we 
assume a flat universe); the primordial power spectrum 
parameters are the spectral tilt, n s (= 1), and the nor- 
malization parameter of primordial curvature perturba- 
tions, A s = S 2 (= 2.3 x 10~ 9 ) (the values in the paren- 
theses denote the fiducial model); the dark energy equa- 
tion of state parameter wq(= —1). W e used the publicly 
available code CAMB developed in iLewis fc Challinorl 
(|2006h (also see ISeliak fc Zaldarriagalll99l " to compute 
the transfer funct ion, and use the fitting formula in 
iSmith et al.l ([2003) to compute the nonlinear mass power 
spectrum from which the lensing power spectrum is com- 
puted over the relevant range of angular scales. 

Our fiducial survey roughly rese mbles the planned Sub - 
aru Hyper-Suprime Cam Survey (jMivazaki et al J 120061 ). 
We adopt the set of filters (grizy) and the depths in each 
filter as given in § [3j We will also study how the results 
change when the hypothetical Subaru survey is combined 
with other surveys that deliver the it-band data or/and 
the NIR data, which especially help improve the photo-z 
accuracies. The survey area is throughout assumed to 
be f2 s = 2000 deg 2 . The redshift distribution of galaxies 
is computed for an assumed subsample of galaxies based 
on the photo-z information. 

4.3. Object selection and clipping of photo-z outliers 

We may be able to construct a suitable subsample of 
galaxies in a sense that the impact of photo-z errors are 
minimized in order not to have more than 100% biases 
in cosmological parameters compared to the statistical 
errors. Hence a selection of adequate galaxies is impor- 
tant for weak lensing: for example, this may be attained 
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by discarding galaxies with ill-defined photo-z's. How- 
ever, the important fact we should keep in mind is that, 
with discarding more galaxies, the statistical accuracy of 
parameter estimation is degraded due to the increased 
shot noise contamination in the power spectrum mea- 
surement. Thus there would be a trade-off point in defin- 
ing a suitable galaxy subsample in terms of the parameter 
bias versus the statistical error for a given survey. 

We throughout this paper work on i-band selected 
galaxies assuming that the i-band data is used for the 
lensing shape measurement as often done in the previ- 
ous lensing works. For our simulated galaxies, given the 
limiting magnitude i = 25.8 at 5<7 significance, a suf- 
ficiently number of photometric galaxies are available: 
the number density for total galaxies is 80 per square 
arcminutes. However, all the galaxies are not usable of 
lensing measurements. First, in order to obtain a reliable 
shape measurement, galaxies used in the lensing analysis 
need to be well resolved, requiring the galaxies to have 
sufficient signal-to- noise ratios (say more than 10er). Sec- 
ondly, galaxies with ill-defined photo-z estimates are not 
useful because including such galaxies may cause a sig- 
nificant bias in parameter estimation. 

With the considerations above in mind, we will con- 
sider the following object selections or their combinations 
to make parameter forecasts. 

• The restrictive range of i-band magnitudes: 22.5 < 
i < 25. The range is a typical one used in the 
weak lensing analysis fe.g. lOkabe et al.|[2009f ). The 
faint-end magnitude cut may be imposed such that 
the selected galaxies have sufficiently high signal- 
to-noise ratios: i = 25 corresponds to S/N ~ 10 in 
our simulations. The bright-end magnitude cut is 
not important, but usually imposed in practice to 
avoid galaxies with saturated pixels. 

• The photo-z selection. We select only galaxies that 
have reasonably good photo-z estimates by impos- 
ing a threshold on the goodness-of-fit of photo-z 
estimation, \1 < 2 (sec Eq. [8]). The clipping 
threshold xt — 2 is not a unique choice. Rather 
we selected the value, as one working example. 

• The restrictive range of photo-z's: 0.2 < Zbf < 1.5. 
The spectral features of galaxies in this range of 
redshifts can be relatively well captured by the 
wavelength coverage of optical filters. The lower 
redshift cut is introduced, because there is a strong 
degeneracy between galaxies at such low redshifts 
z 0.2 and those at higher redshifts, especially in 
a case that the u-band data is not available or shal- 
lower than optical data as considered in this paper. 

• Clipping of photo-z outliers. By discarding galax- 
ies with Var(z) (see Eq. [9]) greater than a given 
threshold, which turn out to be mostly photo-z 
outliers, we define a subsample from the remain- 
ing galaxies that have relatively reliable photo-z's. 
In the following we study the performance of this 
clipping method by varying the clipping threshold 
values of Var(z). 

Table [5] gives the fraction of remaining galaxies com- 
pared to the original sample, where galaxies in each 



TABLE 2 

Fractions of galaxies included in each subsample 



Filters 


Xi<2 


no.2 < z p < 1.5 


D22.5 < i' < 25.0 


grizy 


0.94 


0.61 


0.50 


ugrizy 


0.94 


0.61 


0.50 


grizy J H K 


0.89 


0.57 


0.47 


ugrizy J H K 


0.89 


0.58 


0.48 



Note. — The subsamplcs denoted as "xj < 2" show included 
galaxies selected with X? ^ 2 in the photo-2 fitting for each com- 
bination of filters (see text for the details). The second and third 
columns show the results when further imposing the conditions on 
the ranges of photo-z's and i-band magnitudes as denoted. 



subsample are selected with object selection criteria de- 
scribed above for a given set of filters. The column de- 
noted by u x1 < 2" shows the fraction of galaxies se- 
lected when imposing the threshold on the goodness-of- 
fit for each galaxy. It is clear that this clipping discards 
only a small fraction of galaxies for all the cases of filtcr 
combinations. The second and third columns show the 
fractions when further imposing the restricted ranges of 
photometric redshifts and magnitudes, respectively. 

5. RESULTS 

In this section we show the main results of this paper 
using mock galaxy catalogs. 

5.1. Photo-z Accuracy 

Figure |6] shows the results using different galaxy cat- 
alogs with various combinations of the measured pass- 
bands (see Table [T]), grizy, ugrizy, grizy+JHK and 
ugrizy+JHK from the left- to right-column panels, re- 
spectively. Each panel in the top row shows the photo- 
z performance for the simulated objects with i' < 25.8 
(> 5er), but selected by imposing the condition on the 
goodness-of-fit xt < 2 (see Eq. [8|). It is clear that, 
with broadband photometry alone, the photo-z accuracy 
is limited: a significant contamination of outliers is un- 
avoidable, even with including NIR- and/or it-band data. 

The middle- and lower-row panels show the results ob- 
tained by further discarding photo-z outliers based on 
the clipping method with a given threshold on the quan- 
tity Var(z) (Eq. [9]). The thresholds are chosen such 
that 40% or 70% of the objects in each top-row panels, 
which tend to have ill-defined photo-z's, are discarded, 
respectively. It can be found that the clipping method 
based on the photo-z likelihood function of each galaxy 
can efficiently discard galaxies with ill-defined photo-z's, 
especially when combined with the NIR and u-band data. 

To be more explicit, the upper panel shows how much 
fraction of galaxies are included in the subsample when 
discarding galaxies with Var(z) greater than a given 
threshold denoted on the vertical axis. The solid and dot- 
ted curves show the results without and with the JHK 
data in addition to the optical data, grizy. Note that 
including the CFHT-type u-band data of the assumed 
depth, as given in Table Q] little changes the two results. 
For a given particular value of Var(z), the dotted curve 
has more remaining galaxies than the solid curve, imply- 
ing that these NIR data help improve the photo-z accu- 
racies on individual galaxy basis, i.e. indicating that the 
shape of photo-z likelihood function becomes narrowed 
by adding the NIR data for most of galaxies. 



A clipping method to mitigate the photo- z errors 



9 




Q.3 0.4 0.6 0.H 1 

ratio 




01230123012301234 



'spec 

Fig. 6. — Lower plot: The scatter plots between photometric 
and true redshifts for our simulated galaxies. The different columns 
correspond to the results for different sets of filters as indicated in 
the bottom panels, while the different rows correspond to different 
object selections (see § 14.31 for the details). The upper-row panels 
show the results of samples containing all the galaxies with i < 25.8 
that have their photo-z fits quantified as xi < 2 (sec Eq. [8]). The 
middle- and bottom-row panels show the results of subsamples ob- 
tained by discarding 40% and 70% galaxies with ill-defined photo- 
z estimates, respectively. This is done by choosing the threshold 
value Var(z) (see Eq. [9]) for each galaxy such that the desired frac- 
tions of galaxies are remained in the resulting subsamples. Note 
that, for illustrative purpose, only 5% representative galaxies in 
each subsample are shown in each plot. Upper panel: The ratio 
of remaining galaxies that have Var(z) values greater than a given 
threshold denoted on the vertical axis. The solid curve shows the 
result for the set of filters, grizy, while the dotted curve shows the 
result when the NIR data JHK are added (see Table [l] for the 
details). Note that the results are almost unchanged by further 
adding the it-band data. The vertical thin dashed lines denote the 
selections used in the lower plot, the criteria discarding 40% and 
70% galaxies with poor photo-z's. 
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Fig. 7. — Similar to Figure \E\ but for the different range of 
i-band magnitudes for object selection, 22.5 < i < 25. 



Figure [7] shows the similar result to the previous figure, 
but for brighter samples of galaxies, selected with i < 25 
or equivalently with S/N values greater than 10er in i- 
band. These brighter galaxies are more suitable for the 
accurate shape measurement as discussed in § 14.31 One 
can find that the photo- z accuracy is improved compared 
to Figure [SI 

5.2. Simulating Leasing Tomography: The Impact of 
Photo-z Errors 

We are now in a position to use the photo-z galaxy cat- 
alogs, constructed up to the preceding section, to make 
the trade-off analysis on the number of galaxies within 
a sample versus how "clean" the tomographic rcdshift 
intervals are. Then we study the impact of photo-z er- 
rors on parameter estimation assuming the hypothetical 
lensing tomography experiment. 

Figure [H] shows the redshift distributions of each to- 
mographic bin made by subdividing the photo-z galaxies 
into 3 intervals of photometric redshifts, z p < 0.8, and 
0.8 < z p < 1.5 and z p > 1.5. The redshift intervals are 
chosen such that each redshift intervals contain similar 
number densities for the original mock catalog of galax- 
ies. Note that the redshift binning is fixed in the follow- 
ing analysis for simplicity, which helps to compare the 
results of different galaxy catalogs. Also note that three 
rcdshift bins are a minimal choice of lensing tomography 
for constraining the dark energy equation of state param- 
eter 'W to a reasonable accuracy by efficiently breaking 
para meter degeneracies i n the lensing power spectrum 
(e.g. iTakada fc Jah3l2004h . 

The upper plot in Figure [5] shows the tomographic 
rcdshift distributions constructed from different photo- 
z catalogs in Figure [6j where different panels correspond 
to the different sets of filters and the different clipping 
thresholds. The shaded regions in each panel show the 
photometric redshift distributions of galaxies, which have 
sharp cutoffs in the distributions due to the sharp red- 
shift binning, while the solid line histograms show the 
underlying true redshift distributions. As can be found 
from the top-row panels, if all the galaxies are used, the 
resulting redshift distributions have significant overlaps 
between different redshift bins due to a significant con- 
tamination of photo-z outliers for any combinations of 
filters. On the other hand, the middle- and bottom-row 
panels show that, when 40% or 70% of photo-z outliers 
are discarded by imposing the corresponding thresholds 
on Var(z), respectively, the overlaps can be increasingly 
reduced. In particular, when the optical data is combined 
with the NIR data such that those expected from the 
VIKING survey, the resulting subsamples have almost 
no overlap, if about 70% of galaxies are discarded. We 
should note that such a clean redshift binning can greatly 
reduce a possible contamination of the intrinsic elliptic- 
ity alignments arising from the physically close pairs o f 
galaxies in the similar redshifts ( Tak ada fc Whiteil2004l) . 

The lower plot shows the similar results for higher S/N 
samples with 22.5 < i < 25 whose scatter plots are seen 
in Figure [7] Again, if using the clipping method and 
having a wider coverage of wavelengths, a clean subsam- 
ple with almost no overlap between rcdshift bins can be 
obtained. 

As have been stressed several times, the weak lensing 
power spectra are, to the zero-th order approximation, 
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Fig. 8. — The shaded histograms are the redshift distributions of galaxies based on their photo- z information, i.e. the sharp cuts imposed 
on the photo-z's: < z p < 0.8, 0.8 < z p < 1.5 and z p > 1.5, indicated on the vertical dashed lines. In this case the horizontal axis denotes 
photo-z values. The solid-line histograms are the underlying true distributions, therefore the horizontal axis denotes true redshifts in this 
case. The upper and lower panels are the results using the galaxy catalogs in Figures [6] and [7] respectively. The different panels are for 
different galaxy selections as in Figures [6] and [7] 



sensitive to the mean redshifts of each redshift bins, and 
less to the statistical errors of photo-z 's or the detailed 
shape of redshift distribution. The level of systematic 
photo-z errors in each tomographic bins is quantified in 
Figure IH1 The central values and error bars in this plot 
show the bias in mean redshift and the statistical error of 
the mean redshift, <r((Az)), where Az is defined before 
(see around Fig. [5]) and the average (■ ■ • ) denotes the 
average over all the galaxies in the tomographic redshift 
bin 12 . Note that, for illustrative purpose, the error bars 

12 The statistical error of the mean redshift is reduced from the 



are scaled for a survey area of 1 arcmin 2 , and therefore 
the corresponding errors for our fiducial survey area of 
2000 sq. degrees are much smaller than plotted, by a 
factor of V2000 x 60 2 ~ 2700. 

The middle- and bottom-row panels are the results of 
subsamples obtained by further imposing the condition 
0.2 < z p < 1.5 or 22.5 < i < 25, respectively. It is 
clear that, without clipping ill-defined photo-z galaxies, 
the bias and errors are significant. Note that the red- 
typical photo-z error of each galaxy as <j((z}) ~ (j(z p ^)/N with N 
being the number of galaxies contained in the redshift bin. 
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shift bias for no tomography case sometimes becomes 
smaller than in some tomographic bins (especially high- 
est redshift bins), because photo-z outliers at low- and 
high-redshifts cancel out to some extent in no tomogra- 
phy case. As can be seen from the middle-row panels, 
when the restricted redshift range of 0.2 < z < 1.5 is 
considered, a subsamplc with most accurate photo-z's is 
obtained, because spectral features of galaxies, especially 
the Lyman and 4000 A breaks, are well captured by the 
sets of filters in this redshift range. 
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Fig. 9. — The bias between photometric and true redshifts, 
(Az), in each tomographic redshift interval. The error around 
each point is the statistical error of the mean redshift, a((Az)), 
in each redshift bin. For illustrative purpose, the errors are for 
a survey area of 1 sq. arcminutes, and the errors arc smaller 
by a factor V2000 X 60 2 ~ 2700 for our fiducial survey area of 
2000 sq. degrees. The panels in different columns show the re- 
sults for different sets of filters as indicated on the horizontal axis. 
The upper-row panels are for the subsamples where galaxies with 
i < 25.8 are selected only with the condition \t < 2. The middle- 
and bottom-row panels are for the subsamples where the condition 
0.2 < z p < 1.5 or 22.5 < i < 25 is further imposed for the selection, 
respectively. The round symbols in each panel show the results for 
the whole galaxy sample, while the triangle and square symbols are 
the results for the subsamples discarding 40% and 70% of galax- 
ies with ill-defined photo-z's, respectively, based on our clipping 
method. In each panel the four symbols are for different redshift 
intervals: "binl", "bin2", and "bin3" correspond to the lowest, 
medium and highest redshift bins in Figure \E\ and the leftmost 
symbols are for the case of no tomography, i.e. a single redshift 
interval. 

We next propagate the errors of tomographic redshifts 
into the dark energy parameter w, expected from the 
lensing power spectrum measurements, based on the 
Fisher matrix formalism described in § 14.21 To do this 
we address the following questions: 

• The trade-off of dark energy constraint: the statis- 
tical accuracy of w, <j(w), versus the offset of the 
best-fit value from the true value, Sw. 

• Study the dark energy trade-off against different 
subsamples of galaxies. 

These can be studied by using the simulated galaxy cat- 
alogs. The statistical error of w can be reduced by in- 
cluding more number of galaxies in the sample for a fixed 
range of working multipoles (5 < I < 3000), because the 
shot noise is more suppressed. On the other hand, a 
bias in the best-fit w due to the photo-z errors can be 



reduced by discarding photo-z outliers, leaving a fewer 
number of galaxies in the subsamplc. Hence a trad-off 
point in a(w) versus Sw may be found by compromising 
these competing effects. 

Figure [10] shows the marginalized error a(w) and the 
amount of bias \Sw\ as a function of number densities of 
galaxies included in the corresponding galaxy subsam- 
ples. Again note that we considered the lensing tomog- 
raphy with three tomographic bins for a sky coverage of 
2000 square degrees. The smaller number densities in 
the horizontal axis correspond to subsamples of galaxies 
where more galaxies with ill-defined photo-z's are dis- 
carded by imposing more stringent thresholds on Var(z), 
i.e. smaller threshold values of Var(z), in the clipping 
method. The different curves are for different combina- 
tions of filters. The error a(w) is computed from the un- 
derlying true redshift distribution, i.e. for the case with 
perfect photo-z's, therefore specified by the number den- 
sity in the horizontal axis. When Sw > cr(w), the best-fit 
value of w can be away from the true one by more than 
the lcr error; even if the true model has the cosmological 
constant (w = —1), the result of w ^ — 1 may be falsely 
inferred. Hence a minimal requirement on photo-z accu- 
racies can be assessed from the condition Sw < a{w). 

First, the plot shows that, as the subsample is re- 
stricted to galaxies with more accurate photo-z's, i.e. the 
smaller number densities, the bias in w is reduced to some 
extent. On the other hand, the error a(w) is only slightly 
degraded because the constraint comes mainly from the 
sample variance limited regime for a given range of work- 
ing multipoles (5 < I < 3000). 

It is also shown that the bias can be reduced by adding 
the NIR- and/or u-band data. However, the broadband 
data alone may not be sufficient to reduce the bias. The 
optimal range of redshifts needs to be considered, and a 
brighter subsample whose galaxies have higher S/N val- 
ues in each filter is preferred to sufficiently reduce the 
bias, as implied from the middle and right panels. De- 
pending on the available set of filters, the compromis- 
ing point can be obtained around the number densities 
h g = [10,30] axemin , i.e. more than 60% of ill-defined 
photo-z galaxies need to be discarded. It would also 
be worth noting that combining the lensing constraints 
with other dark energy probes such as the baryon acous- 
tic oscillation experiment may allow to further calibrate 
photo-z errors by breaking parameter degeneracies. 

We have so far paid special attention to how to elim- 
inate photo-z outliers in order to obtain a subsample of 
galaxies suitable for tomographic lensing measurements. 
However, due to the limitation of photo-z accuracies, 
there may remain a residual bias in the tomographic 
redshift bins, even if photo-z outliers are completely re- 
moved. To study this, Figure QT] shows the results ob- 
tained by artificially discarding photo-z outliers accord- 
ing to the clipping criteria 

logi±^>±i, (12) 
1 + z s 

with t = 0.1,0.3 and 0.5, respectively. The figure shows 
that a bias in w cannot be fully eliminated even if the 
outliers are completely discarded. This implies that there 
remains a residual bias in the mean redshift for each to- 
mographic bins due to asymmetric photo-z errors around 
z p = z s , therefore the residual biases would need to be 
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Fig. 10. — The forecasted constraints on the dark energy equation of state parameter w as a function of number densities of galaxies 
included in the corresponding galaxy catalogs, expected for the lensing tomography experiment with survey area of 2000 deg 2 in combination 
with the Planck CMB information. The three redshift bins are considered for each galaxy subsamplcs as in Figurc[8] The solid curve in each 
panel shows the marginalized error cr(w) assuming no photo-z errors. The other cu rves show the offset bias of the best-fit w from the true 
value (w = —1), computed by using the Fisher matrix formalism (see around Eq. 1111 ): the dotted, dashed, long-dashed and dot-dashed 
curves are for combinations of filters, grizy, ugrizy, grizyJHK and ugrizyJHK, respectively. The left, middle and right panels are the 
results for different object selections as in Figure [8] 
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Fig. 11. — As in the previous figure, but the results obtained by 
artificially discarding photo-z outliers given as log(l+Zp)/ (l+z s ) > 
±0.1, 0.3 and 0.5 (the three dotted curves from top to bottom, 
respectively). The filter combination grizy is considered in this 
plot. For comparison the top das hed curve in each panel is the 
same as the dotted curve in Figure [10] 

calibrated, e.g. by using a spectr oscopic training sub- 
sample (e.g. iMa fc Bernstein! [200l) . 

5.3. Angular cross-correlations of galaxies between 
different photo-z bins 

An alternative way to identify photo-z outliers is us- 
ing angular cross-co rrelations of ga l axies between dif- 
ferent photo-z bins (iNewmanl [2001 lErben et all 120091: 
IZhang et all 120091 : ISchubd 120091 ) . As implied in Fig. 
photo-z errors cause overlaps of galaxies between differ- 
ent redshift bins. Therefore photo-z errors may cause 
non-vanishing cross-correlations of galaxies between dif- 
ferent photo-z bins, if the galaxies indeed have similar 
true redshifts, therefore are physically correlated with 
each other. In other words the cross-correlations can, al- 
beit statistical, be used to monitor a contamination of 
photo-z outliers. In this subsection we use our simulated 
photo-z catalogs to estimate the expected signal-to-noise 
ratios for measuring the cross-correlations assuming the 
same survey parameters we have considered. 

Assuming the Limber approximation, the angular 
power spectra of galaxies in the i- and j-th photo-z bins 



are given as 



poo 

CffW = / dz 
Jo 



dz bibj rii{z)nj{z) — ft 



dx X 2 



Ps 



X 



(13) 



where n 2 (z) is the underlying true redshift distribution 
for the i-th photo-z bin and n% is its mean number den- 
sity. In the following we consider a sharp redshift binning 
in photo-z space, however, the underlying true distribu- 
tions generally have overlaps due to photo-z errors. We 
here simply assume that the galaxy distribution in the i- 
th bin is related to the matter distribution via constant 
bias parameter bi, which is taken to &, = 1 for all the 
photo-z bins for simplicity. To make this assumption 
reasonable, we restrict the following analysis to a range 
of low multipoles I < 500. Notice that the cross power 
spectra are not affected by shot noise. 

The strength of cross-correlations or redshift leakages 
can be quantified by the cross-correlation coefficients at 
each multipole: 



(14) 



The coefficient fiij ~ 1 implies significant cross- 
correlations between the i- and j-th bins compared 
to their auto-spectra, while fiij = means no cross- 
correlation or no leakage of photo-z outliers into different 
bins. The total signal-to-noise ratios expected for mea- 
suring the cross-correlations can be estimated as 



r 2 



c 



Cu Co 



(15) 



Here we simply assume the Gaussian covariances to 
model the statistical errors in measuring cross power 
spectra from a survey. Note that the error covariance 
includes the shot noise contamination via the auto spec- 
tra Cu and C^. As for the minimum and maximum 
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Fig. 12. — Angular cross-correlations between galaxies in two photo-z bins, denoted on the horizontal and vertical axes, for our fiducial 
survey with 2000 square degrees, adopting 35 redshift bins over < z p < 3.5 with the bin width dz = 0.1. As in Fig. [6] the left panel shows 
the result for galaxy catalog selected with ugrizy and x„ < 2, while the middle- and right-panels show the results for the catalogs where 
40% and 70% of galaxies wit h ill-defined photo-z's are discarded. The upper-left off-diagonal components in each panel show the correlation 
coefficients fiij (see Eq. 1141 ) at multipole i = 100. Compared with Fig. [6] one can find photo-z outliers cause "island" regions with high 
coefficients fiij ~ 1. The lower-right components show the cumulative signal-to-noise ratios (S/N) expected in measuring the cross-power 
spectrum over a range of multipoles 5 < i < 500. Again the cross-correlations between different bins, caused by photo-z outliers, show high 
S/N values greater than 10. 



multipoles used in the summation, we adopt £ m j n = 5 
and ^ max = 500, respectively. 

Figure [T2] studies the angular cross-correlations for our 
fiducial survey parameters with 2000 square degree cover- 
age. Here we consider the photo-z galaxy catalog selected 
with ugrizy and \1 < 2, and adopt 35 redshift bins over 
< z p < 3.5 with the bin width dz — 0.1 corresponding 
to a typical photo-z error on individual galaxy basis. The 
upper-left triangle in each panel shows the correlation co- 
efficients of cross-power spectra between the two differ- 
ent photo-z bins, Hij. The coefficients have large values 
around the diagonal terms, i.e. z pi ~ z p j, because the 
photo-z errors cause significant overlaps between neigh- 
boring redshift bins. Compared with the results in Fig. [SI 
one can find that photo-z outliers cause some isolated re- 
gions with significant correlation coefficients. 

The lower-right triangle shows the expected signal-to- 
noise ratios, S/N, for measuring cross-correlations. The 
sufficiently high S/N values, say greater than 10, can 
be expected for redshift bins that have high correlation 
coefficients. Thus monitoring the cross-correlations be- 
tween different photo-z bins may allow to further identify 
photo-z outliers, in a statistical sense. However, the gen- 
uine power of cross-correlation method for eliminating 
the outliers or calibrating the photo-z errors needs to be 
more carefully studied. 

Fina l ly we remark on a more quantitative work done in 
ISchulzl (|2009() . which studied, based on mock simulations, 
the use of cross-correlations of photometric galaxies with 
an overlapping spectroscopic sample to calibrate the red- 
shift distribution of the photometric galaxies without us- 
ing the photo-z information. While the main purpose 
of this paper is not using the cross-correlations to iden- 
tify photo-z outliers, the promising result shown is that 
the redshift distribution can be well reconstructed by us- 
ing a sufficiently large spectroscopic sample. However, 
one of the limiting factors realized is the reconstruction 
requires a sufficiently fine binning of spectroscopic red- 
shifts, which tends to make the cross-correlation mea- 
surements noisy. Therefore it would be interesting to 
study how the method can be further refined by combin- 
ing the cross-correlation method and the photo-z infor- 



mation, and the combined method may relax a require- 
ment on the size of spectroscopic ca libration sample. We 
also n ote that, as pointed out in iBernstein Sz Hutererl 
(|2009h . the cross-correlation method is affected by the 
lensing magnification bias, which may cause an apparent 
correlations between foreground and background galax- 
ies even if there is no photo-z errors to cause redshift 
overlaps. This effect also needs to be included, and mock 
simulations would be useful for such a study on the cross- 
correlation method. 

6. SUMMARY AND DISCUSSION 

In this paper we have studied how photo-z errors avail- 
able from broadband multi-color data affect cosmolog- 
ical parameter estimation obtained from tomographic 
lensing experiment. To do this, we made the simu- 
lated mock galaxy catalog with photo-z information con- 
structed from the COSMOS catalog. Since the photo-z 
errors are sensitive to survey parameters such as available 
filters, the depths, and so on, we considered in this paper 
the survey parameters to resemble the planned Subaru 
Hyper Suprime-Cam survey, which is characterized by 
the optical multi-passband data (grizy) and the depth 
i ,$ 26. We also studied how the photo-z accuracy can 
be improved if combining the optical data with the it- 
band data expected from a CFHT-type telescope and the 
NIR {JHK) data from the VIKING- type survey. How- 
ever, the method developed in this paper can be readily 
extended to other weak lensing surveys. 

We particularly paid our attention to how to construct 
a galaxy subsample suitable for weak lensing tomogra- 
phy. We showed that photo-z outliers can be efficiently 
identified by monitoring the posterior likelihood of red- 
shift estimation for each galaxy: more exactly, the width 
of likelihood function defined by the second moment 
around the best-fit redshift parameter (see Eq. [9]) was 
used as an indicator of the photo-z accuracy on individ- 
ual galaxy basis. It was also shown that the photo-z 
outliers can be more efficiently removed by restricting 
the ranges of working magnitudes and/or redshifts, and 
adding the u- and NIR bands (see Figs. [BHSJ. 

Using the Fisher matrix formalism, we estimated how 
the photo-z errors in a defined galaxy catalog cause bi- 
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ases in cosmological parameters, especially the dark en- 
ergy equation of state parameter w. It was shown how 
the parameter biases can be reduced with discarding 
galaxies with ill-defined photo-z estimates. However, 
with discarding more galaxies, the statistical accuracies 
of parameters are degraded due to the increased shot 
noise contamination. We found that the trade-off point, 
where the parameter bias becomes similar or smaller than 
the marginalized statistical error, can be achieved if a 
large fraction of ill-defined photo-z galaxies (~ 70%) are 
discarded and if combined with the u- and NIR-band 
data sets (Fig. HQ]). 

However, as demonstrated in Fig. Ill| even if photo-z 
outliers are completely eliminated, there may remain a 
non- negligible, residual bias in the mean redshift of each 
tomographic bin because the scatters around the relation 
between photometric and true redshifts, z p = z s , are not 
necessarily symmetric and therefore not perfectly can- 
celed even after the average of galaxies in each redshift 
bin. Therefore a careful calibration of the residual photo- 
z errors will be inev itably needed for any future surveys 
(jHearin et al.ll2010l) . 

A powerful method for the photo-z calibration is us- 
ing a training spectroscopic sub-sample. Naively, if a fair, 
representative spectroscopic subsample of imaging galax- 
ies used in the lensing analysis is available, it allows a 
calibration of photo-z errors. However, the size of such 
a spectroscopic sample required for achieving the mean- 
ingful dark energy constraint becomes very large; for a 
lensing survey with sky coverage of more than 1000 sq. 
degrees, containing more than 10 s imaging galaxies, a 
subsampl e with more than 10 6 spectroscopic redshifts is 
required (|Huterer et al.ll2006HMa et al.ll2006[ ). Note that 
the currently largest redshift sample is given by the COS- 
MOS project containing 10 4 redshifts. Thus a survey 
collecting 10 6 spectra, which is required for our sample 
fully calibrated, is observationally very expensive and al- 
most infcasiblc, especially if redshifts of fain t galaxies are 
needed (but see iBernstein fc Hutererl 120091 for relaxing 
the requirement). 

On the other hand, there i s a new method recently 
proposed (IMandelbaum et al.l 120081 : iLima et all 120081: 
ICunha et al.ll2009f ). using a spectroscopic subsample of 
smaller size, which is not necessarily a fair, representa- 
tive subsample of imaging galaxies. First spectroscopic 
galaxies are compared to imaging galaxies in multi- 
dimensional color space, rather than the photo-z space. 
Secondly the ratio between number densities of spectro- 
scopic and imaging galaxies is computed at each point 
in multi-color space. Then the ratio is multiplied to the 
redshift distribution of spectroscopic subsample to infer 
the underlying redshift distribution of imaging galaxies. 
Thus this weighting method may allow the photo-z cal- 



ibration using a spectroscopic subsample of smaller size. 
However, there is still an open issue to be carefully inves- 
tigated in this method. For example, it is unclear how 
the calibration degrades if the spectroscopic subsample 
has significant sample variance fluctuations in the red- 
shift distribution, e.g. due to clustering contamination 
at particular redshifts due to a finite area coverage. 

Another calibration method is using cross-correlations 
of galaxies in photo-z bins, as partly studied in Fig. [T2l 
Again the non-vanishing cross-correlations only arise 
when the photo-z errors cause leakages into different bins 
of true redshifts. Or spectroscopic galaxies in the same 
survey region, if available, can also be used to cross- 
correlate with imaging galaxies in order to calibrate the 
photo-z errors over a range of redshifts covered by the 
spectroscopic sample ([Newm an 2008; Schulz 20091). How- 
ever spectroscopic galaxies may be correlated with only 
particular types of galaxies, therefore, this method may 
have a limitation. Hence it would be interesting to ex- 
plore how to calibrate the redshift distribution of imaging 
galaxies down to the required accuracy level by combin- 
ing various methods, the method developed in this pa- 
per and the methods based on spectroscopic subsample 
or/and cross-correlation measurements. 

Finally we comment on another important contaminat- 
ing effect, the intrinsi c alignment in galaxy shap es 



iHirata fc Seliakl l2004t IMandelbaum et al 



As studied in detail in 



: y snapes (e.g 
I I20M 12009) 



King fc Schneider! (120031) (also 



see iHevmans fc Heavens] 120031 : iTakada fc White! l2004t 
iBridle fc Kingl l2007t ). accurate photo-z information is 
needed to calibrate and/or correct for the intrinsic align- 
ment contamination in weak lensing tomography. A sub- 
sample with reliable photo-z's, constructed based on the 
method in this paper, may be also useful for this purpose. 
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